Asymptotic Analysis of Self-Adjusting Contraction Trees

نویسنده

  • Pramod Bhatotia
چکیده

In this report, we analyze the asymptotic efficiency of self-adjusting contraction trees proposed as part of the Slider project [2, 3]. Self-adjusting contraction trees are used for incremental computation [1, 4, 5, 8]. Our analysis extends the asymptotic efficiency analysis of Incoop [6, 7]. We consider two different runs: the initial run of an Slider computation, where we perform a computation with some input I, and a second run for dynamic update where we change the input from I to I′ and perform the same computation with the new input. In the common case, we perform a single initial run followed by many dynamic updates. For the initial run, we define the overhead as the slowdown of Slider compared to a conventional implementation of MapReduce such as with Hadoop. We show that the overhead depends on communication costs and, if these are independent of the input size, which they often are, then it is also constant. Our experiment evaluation confirms that the overhead is relatively small. We show that dynamic updates are dominated by the time it takes to execute fresh tasks that are affected by the changes to the input data, which, for a certain class of computations and small changes, is logarithmic in the size of the input. In the analysis, we use the following terminology to refer to the three different types of computational tasks that form an Slider computation: Map tasks, Self-adjusting balanced tree (applications of the Combiner function for three different modes of operation for sliding-window computations), and Reduce tasks. Our bounds depend on the total number of map tasks, written NM, and the total number of reduce tasks written NR. In addition, we also take in account the total number of stages in self-adjusting balanced tree, denoted as NC. We write ni and nO to denote the total size of the input and output respectively, nm to denote the total number of key-value pairs output by the Map phase, and nmk to denote the set of distinct keys emitted by the Map phase. The number of stages in self-adjusting balanced tree is a property of sliding-window computation mode: append-only (NCA = O(nmk)), fixed-width window slides (NCF = O(nmk · ⌈log2(buckets)⌉)), and variable-width window slides((NCV) = ⌈O(nmk · log2(NM)⌉)). For our time bounds, we will additionally assume that each Map, Combine, and Reduce function performs work that is asymptotically linear in the size of their inputs. Furthermore, we will assume that the Combine function is monotonic, i.e., it produces an output that is no larger than its input. This assumption is satisfied in most applications, because Combiners often reduce the size of the data (e.g., a Combine function to compute the sum of values takes multiple values and outputs a single value).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

$G$-asymptotic contractions in metric spaces with a graph and fixed point results

In this paper, we discuss the existence and uniqueness of fixed points for $G$-asymptotic contractions in metric spaces endowed with a graph. The result given here is a new version of Kirk's fixed point theorem for asymptotic contractions in metric spaces endowed with a graph. The given result here is a generalization of fixed point theorem for asymptotic contraction from metric s paces to metr...

متن کامل

An Evaluation of Self-adjusting Binary Search Tree Techniques

Much has been said in praise of self-adjusting data structures, particularly self-adjusting binary search trees. Self-adjusting trees are most suited to skewed key-access distributions as the techniques attempt to place the most commonly accessed keys near the root of the tree. Theoretical bounds on worst-case and amortized performance (i.e. performance over a sequence of operations) have been ...

متن کامل

Fixed points for E-asymptotic contractions and Boyd-Wong type E-contractions in uniform spaces

In this paper we discuss on the fixed points of asymptotic contractions and Boyd-Wong type contractions in uniform spaces equipped with an E-distance. A new version ofKirk's fixed point theorem is given for asymptotic contractions and Boyd-Wong type contractions is investigated in uniform spaces.

متن کامل

The eccentric connectivity index of bucket recursive trees

If $G$ is a connected graph with vertex set $V$, then the eccentric connectivity index of $G$, $xi^c(G)$, is defined as $sum_{vin V(G)}deg(v)ecc(v)$ where $deg(v)$ is the degree of a vertex $v$ and $ecc(v)$ is its eccentricity. In this paper we show some convergence in probability and an asymptotic normality based on this index in random bucket recursive trees.

متن کامل

Asymptotic Cost of Cutting Down Random Free Trees

In this work, we calculate the limit distribution of the total cost incurred by splitting a tree selected at random from the set of all finite free trees. This total cost is considered to be an additive functional induced by a toll equal to the square of the size of tree. The main tools used are the recent results connecting the asymptotics of generating functions with the asymptotics of...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1604.00794  شماره 

صفحات  -

تاریخ انتشار 2016